Outline for an information theoretic search engine
نویسنده
چکیده
It is proposed an information theoretic search engine is like RADAR. The query words are the emitted signals and the document database is the object to be detected. Various echoes come off the database, and analogous with echo cancelation, the signal with the lowest entropy is selected. Commensurate with Shannon's theory, low entropy documents are signal, higher entropy documents are noise. Thus, my proposal separates signal from noise. As many relevant documents can be tined to be signal as desired.
منابع مشابه
Review of ranked-based and unranked-based metrics for determining the effectiveness of search engines
Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...
متن کاملAdvertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles
When emerging technologies such as Search Engine Marketing (SEM) face tasks that require human level intelligence, it is inevitable to use the knowledge repositories to endow the machine with the breadth of knowledge available to humans. Keyword suggestion for search engine advertising is an important problem for sponsored search and SEM that requires a goldmine repository of knowledge. A recen...
متن کاملAn Ensemble Click Model for Web Document Ranking
Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...
متن کاملTowards a game-theoretic framework for text data retrieval
The task of text data retrieval has traditionally been defined as to rank a collection of text documents in response to a query. While this definition has enabled most research progress so far, it does not model accurately the actual retrieval task in a real search engine application, where users tend to be engaged in an interactive process with multipe queries, and optimizing the overall perfo...
متن کاملCha-Cha: A System for Organizing Intranet Search Results
Although search over World Wide Web pages has recently received much academic and commercial attention, surprisingly little research has been done on how to search the web pages within large, diverse intranets. Intranets contain the information associated with the internal workings of an organization. A standard search engine retrieves web pages that fall within a widely diverse range of inform...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PeerJ PrePrints
دوره 3 شماره
صفحات -
تاریخ انتشار 2015